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ABSTRACT 

The MiN Energy Broadcast problem consists in assign- 
ing transmission ranges to the nodes of an ad-hoc network 
in order to guarantee a directed spanning tree from a given 
source node and, at the same time, to minimize the energy 
consumption (i.e. the energy cost) yielded by the range as- 
signment. MiN Energy Broadcast is known to be NP- 
hard. 

We consider random-grid networks where nodes are chosen 
independently at random from the n points of a y'n x y^n 
square grid in the plane. The probability of the existence of 
a node at a given point of the grid does depend on that point, 
that is, the probability distribution can be non-uniform. 
By using information-theoretic arguments, we prove a lower 
bound (1 — e)^ on the energy cost of any feasible solution 
for this problem. Then, we provide an efficient solution of 
energy cost not larger than 1.1204^. 

Finally, we present a fully-distributed protocol that con- 
structs a broadcast range assignment of energy cost not 
larger than 8n, thus still yielding constant approximation. 
The energy load is well balanced and, at the same time, the 
work complexity (i.e. the energy due to all message trans- 
missions of the protocol) is asymptotically optimal. The 
completion time of the protocol is only an O(logn) factor 
slower than the optimum. The approximation quality of our 
distributed solution is also experimentally evaluated. 
All bounds hold with probability at least 1 — l/n®'-^-'. 

1. INTRODUCTION 
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Range assignments in ad-hoc networks. In ad-hoc net- 
works, nodes are able to vary their transmission ranges in 
order to provide good network connectivity and low energy 
consumption at the same time. More precisely, the trans- 
mission ranges determine a (directed) communication graph 
G{S, E) over the set S of nodes: a node v, with transmis- 
sion range r, can transmit to another node w (so, edge 
{u, w) € E) if and only if w belongs to the disk centered 
in V and of radius r. The transmission range of a node de- 
pends, in turn, on the energy power supplied to the node. 
In particular, the power required by a node v to cor- 
rectly transmit data to another station w must satisfy the 
inequality P„ > dist(u,w)^, where dist(w,w) is the Eu- 
clidean distance between v and w. In several works [11 1121 
1171 124) , it is assumed that nodes can arbitrarily vary their 
transmission range over the "large" set {dist(s, t)|s, i G 5}. 
However, in some important network scenarios (like sensor 
networks), this assumption is not realistic: the adopted tech- 
nology allows nodes to have only few possible transmission 
range values. For this reason, we adopt the model consid- 
ered in [9] 1101 1161 138) where nodes are able to choose their 
transmission range from a restricted set F. 

A fundamental class of algorithmic problems arising from 
ad-hoc wireless networks consists in the range assignment 
problems: find a transmission range assignment r : S ^ T 
such that (1) the resulting communication graph satisfies 
a given connectivity property 11, and (2) the energy cost 
cost(r) = X^r(s)'^ of the assignment is minimized (see )17l 

m)- 

Several research works [l] 1121 |8l [17] have been devoted to 
the MiN Energy Broadcast problem where 11 is defined 
as follows: Given a source node s, the communication graph 
has to contain a directed spanning tree rooted at s (a branch- 
ing from s ). Previous theoretical results on MiN Energy 
Broadcast concern worst-case analysis only. This problem 
is known to be NP-hard [12] even when F = {0,/i,Z2} for 
li < h and h is set to any fixed positive constant. The 
most famous approximation algorithm is the M5r-based 
heuristic [IT]. This heuristics works in Q{n^) time and its 
performance analysis has been the subject of several works 
over the last years [121 1181 136) . In [1], it is finally proved 



the tight bound 6 on its approximation ratio. More re- 
cently, a new polynomial-time algorithm is provided in [8] 
that achieves approximation ratio close to 4. This algorithm 
applies a rather complex edge-contraction technique on the 
MST-based solution. Its present best version works in O(n^) 
time and the design of any efficient distributed version seems 
to be a very hard task. 

It is important to observe that the MST-based heuristic is 
"far" from achieving optimal solutions even on a complete 
square grid of n points [5l [1^: its worst-case approxima- 
tion ratio on such grids is not smaller than 3. In [TU], it is 
also experimentally observed that this heuristic has a bad 
behavior when applied to random regular instances such as 
faulty square grids. Furthermore, the MST-based heuristic 
requires a large range set F. 

The above discussion leads us to study MiN Energy Broad- 
cast over random grid networks. Given a y'n x y'n grid of 
points of the Euclidean plane (without loss of generality, ad- 
jacent points are placed at unit distance), each poinlQ i is 
selected as a node of the random grid network independently 
with probability pi. This node probability can be any value 
in the interval [pmin,Pmax] where Pmin and pmax are two ar- 
bitrary positive constants in the interval (0, 1). We remark 
that our random grids are in general non uniform: Random 
grids provide a good model for several ad-hoc and sensor 
networks. On one hand, by varying the pi's values, it is pos- 
sible to model non homogenous input configurations with 
regions of different node densities. On the other hand, the 
grid structure guarantees a minimal distance among nodes: 
this is often a desired property in order to optimize area 
coverage and avoid message collisions. Nevertheless, as dis- 
cussed later, all our results also hold for the standard uni- 
form random distribution (i.e. the random input formed by 
choosing n points independently and uniformly at random 
from a 2-dimensional square) |25l |33] . 

Our results. 1) We provide a lower bound on the energy 
cost of feasible solutions for any range assignment problem 
on random grids where the required property 11 implies the 
existence of a disk cover. We say that a range assignment 
is a (disk) covering assignment if it guarantees that every 
node of the network is within the positive range of some 
node. MiN Energy Broadcast is just one of those impor- 
tant cases requiring covering range assignments. 
Let h be the minimum positive range in P. For any < 
e < 1, if Zi = f2(l) then we prove that the energy cost of 
any covering range assignment is with high probabilitj{f| (in 
short, w.h.p.) at least (1 — e)^. Observe that the lower 
bound tends to n/n for any h such that h — u}{l), so for 
minimal ranges much smaller than the connectivity threshold 

e{Vi^) [nmaisiEi]. 

The proof's technique of the lower bound departs signifi- 
cantly from all those adopted in this topic and uses information- 
theoretic arguments. By using this result, we will prove that 



^For the sake of simplicity, we here assume that points are 
labelled with index i — 1, . . . ,n. 

^Here and in the sequel the term with high probability means 
that the event holds with probability at least i — -^ for some 
constant a > 0. 



the next two algorithms are almost optimal. 

2) We provide a simple and efficient algorithm for random 
grids that uses minimal range h = 0(Vlog n) and returns 
a solution of energy cost not larger than 1.1204^ w.h.p.: In 
virtue of our lower bound, this is very close to the optimum. 
Observe that our lower bound holds for any covering range 
assignment while the upper bound holds for feasible range 
assignments of MiN Energy Broadcast: this implies that, 
for li = 01(1), the extra-cost, due to the required tree con- 
nectivity property, is "almost" negligible in random grids (it 
is still an open problem whether this is in fact negligible). 
Our algorithmic solution works in 0(n log n) time and needs 
a set r of logarithmic size (in n). The range assignment is 
inspired to the one provided in [5] for complete square grids 
(i.e. every point of the grid is a node of the network). How- 
ever, the probabilistic cost analysis of our construction for 
random grids is definitely not related to that in [5]. 

3) It is common opinion that the development of efficient, 
provably-good distributed algorithms is presently the ma- 
jor challenge about range assignment problems [4l 1361 [17] . 
We provide an efficient distributed algorithm for MiN En- 
ergy Broadcast on random grids. We investigate the per- 
formance of the protocol in two different scenarios: single 
broadcast and many-broadcast, i.e., a sequence of consecu- 
tive broadcast operations. In both cases, besides the energy 
cost of the returned range assignment, we consider further 
important complexity aspects that determine the quality of 
a distributed solution. 

- Work Complexity. In the ad-hoc network model, the work 
complexity of a distributed algorithm (i.e. protocol) for MiN 
Energy Broadcast is defined as the sum of the energy 
cost of all transmissions made by the protocol to perform 
the broadcast operation [231 1271 135| . This complexity mea- 
sure thus considers both the cost to construct the range as- 
signment and the cost to use it to broadcast the message 
(the latter being exactly the cost of the range assignment 
defined for centralized algorithms). Since both the above 
energy costs are paid by the nodes, a protocol can be really 
considered energy efficient only if it has a small work com- 
plexity. 

- Energy-Load Balancing. In some real ad-hoc networks 
(such as sensor networks), it is also important to equally 
distribute the energy load to all nodes. For instance, solu- 
tions, assigning large ranges to few nodes, are not feasible in 
scenarios where nodes have limited battery charges. In such 
a case, we aim to design solutions that are well energy-load 
balanced. Notice that, in the many-broadcast scenario, this 
corresponds to maximize network lifetime according to the 
model in 00. 

- (Amortized) Completion Time. Another relevant aspect of 
a broadcast protocol is the completion time, i.e., the number 
of time steps required to complete one broadcast operation. 
In the many-broadcast scenario, the amortized completion 
time is the average completion time for one broadcast oper- 
ation. 

Our aim is to derive a protocol having provably-good perfor- 
mance with respect to all the above complexity aspects. To 
the best of our knowledge, no available protocol has been 
shown to have this overall performance. 
We first define a very simple range assignment where only 



one positive range in F is used, provided that it is not smaller 
than Cy/log n where c is a suitable positive constant (ob- 
serve again that this value is asymptotically equivalent to 
the connectivity threshold). This solution is then shown to 
be w.h.p. feasible and to have an energy cost not larger than 
8n. Thanks to our lower bound, the achieved energy cost 
yields a constant approximation ratio. Moreover, this sim- 
ple range assignment can be constructed and managed by 
an efficient protocol. We assume every node initially knows 
n and its relative position with respect to the grid only. Po- 
sitioning information can be obtained by using GPS systems 
or Ad-Hoc Positioning System (APS) |31) . This assumption 
is reasonable in static ad-hoc networks since every node can 
store once and for all its position in the set-up phase. The 
protocol exploits a fully-distributed pivot-election strategy 
borrowed from 

We prove that the work complexity of the protocol is equiv- 
alent to the energy cost of the centralized version and hence, 
thanks again to our lower bound (clearly, a lower bound for 
the energy cost is also a lower bound for the work complex- 
ity), it achieves a constant approximation ratio as well. It is 
important to emphasize that the best distributed algorithm 
to compute an MST in the ad-hoc model has an expected 
work complexity f2(nlogn) [211 1231 l28l 1291 . By comparing 
this bound with the 0(n) work complexity achieved by our 
protocol, we can state that any MST-based solution [171 [8] 
cannot yield good work complexity in this scenario. Other 
distributed solutions have been considered in the literature 
[4l 1271 l35l 137) . however their performance analysis is based 
on experimental tests only. We also compared the work 
complexity of our protocol to the energy cost of the cen- 
tralized MST-based solution over thousands of random in- 
stances with different sizes and densities. The average per- 
formance ratio between the two solutions is always between 
2 and 3 (see Section [4. l[l thus confirming our analytical re- 
sults. 

Our protocol yields a good energy- load balanced solution: 
there are 0(n/logn) pivots, i.e., the nodes having range 
0( Vlog n) (the remaining nodes have range 0). Further- 
more, thanks to the pivot-election strategy [6], a good energy- 
load balance is also obtained with respect to an arbitrary se- 
quence of broadcast operations, i.e., for the many- broadcast 
scenario. At every new operation, the pivot task is indeed 
assigned to nodes according to a Round Robin rule. We 
show this yields an almost optimal life-time of the network 
according to the energy consumption model in [6113- 
As for the single broadcast scenario, the completion time 
of our protocol is slower than the optimum by a O(logn) 
factor. As for the many-broadcast scenario, when the num- 
ber of broadcast operations is f2(logn), then the amortized 
completion time is optimal. 

Finally, we notice that, by using the technique in [30], our 
protocol can be emulated on the standard uniform random 
distribution [251 I33| . The same holds for the centralized 
algorithm achieving cost 1.1204^ and for the lower bound 
as well. The relative proofs for the uniform distribution are 
easier and, so, they are omitted in this extended abstract. 



Paper's Organization. In Section [21 we provide the proof 
of the lower bound. In Section|3] we describe the centralized 
algorithm yielding almost optimal cost. Finally, Section|4]is 



devoted to the description of our distributed protocol and 
its analysis. 

1.1 Preliminaries 

The square grid R of y'n x y'n points will be indexed from 
1 to n. Without loss of generality, the distance between 
adjacent points is set to 1. To each point i of ii, a probability 
value Pi is assigned such that Pmin < Pi < Pmax where 
Pmin and Pmax are arbitrary constants in (0, 1). We consider 
the random input model TL{R,p\, . . . ,p„) where an instance 
S C _R has probability 



iGS i€R-S 

Observe that this probability distribution is equivalent to 
select each point i £ R independently with probability pi. 
A selected point will be called node. In the sequel, a subset 
S R selected according to the above random distribution 
will be simply called random set. 

A set of disks C — {Di , . . . , Dm} is said to be an Z-cover for 
5 C 7? if the following properties hold: i) All disks of C have 
radius at least I, where I is some positive value, ii) Every 
disk of C has its center on a node of S. Hi) Every node of S 
is covered by some disk of C. 

Observe that a range assignment r : S ^ T = {0, li,l2 ■ ■ ■ ,lk} 
can be represented by the family of disks C — {Ci, . . . , Ce} 
yielded by the positive values of r, and its energy cost is 
defined as 



e 

cost(C) = r'^ where n is the radius of Ci (1) 

i=l 

Furthermore, a feasible range assignment for the MiN En- 
ergy Broadcast problem, with input S and F, uniquely 
determines an ii-cover for S having the same cost. Notice 
that the converse is not true in general. 

2. THE LOWER BOUND 

In this section, we provide a lower bound on the cost of any 
covering range assignment for a random set S C _R. 

Definition 1 Let Pr[R, e, I] be the probability that a random 
set S C _R has an l-cover of cost not larger than (1 — e)^. 

Theorem 2 Let 5,pmin and pmax be three constants such 
that < S < 1 and < Pmin < Pmax < 1. Let S ^ R be any 
random set. Then, for any e with < e < 1, for sufficiently 
large n, and for 



e(l — S)pmin 

it holds that 

Pr[R,e,l] < 2-TTO(i-'')^J'™»""+'°g(4n)+t [.f J +t 



where t - 



100 



+ 1 



e(l - 5)pr, 



The binary string cod{S) encoding S is the concatenation of 
four substrings NUM, CEN, RAD and GOV. 



The above theorem clearly implies our lower bound stated 
in the Introduction and it requires no restriction about the 
transmission-range set F but a lower bound on h = I that 
does not depend on n. In particular, if e is any positive 
constant then, for sufHciently large grids and a sufficiently 
large constant I (so I does not depend on n), Pr[ii, e, /] is 
not larger than the inverse of an exponential function in n. 

The theorem's proof makes use of the following combinato- 
rial result. 

Lemma 3 Let Ri, . . . , Rt be a partition of the n points in 
R and let (ki, . . . ,kt) be any t-tuple of integers such that 
< fcj < |_Rj| = Uj. Then, the number of subsets S of R 
such that \S n Rj\ — kj (1 < j < t) admitting an l-cover C 
with I > ^/e and cost(C) < is at most 

where q = min |^|l<j<t| and \{n, q, e, I, t) = 
-|-log(4n) +t 



a) NUM reports the number m of centers of C 

b) CEN reports information to recover the indices of the 

m nodes of S that are centers in C (we assume that 
the n points in the grid are numbered from 1 to n). 

c) RAD reports information to recover the radii of the m 

nodes in C' . 

d) COV reports information to recover the indices of the 

nodes in S. 



We now explain how these data are encoded and then bound 
the length of each of the four substrings. 



a) The number m of centers in C is at most ]S] < n. Thus 

we encode it by a binary string of fixed length (i.e. 
[log(n -f 1)] ). Hence 

\NUM\ = riog(n + 1)1 < logn 1 (2) 

b) The centers of C are a subset of the n points in R and 

so we encode them by a string of fixed length, i.e. 



We now provide a brief description of the information-theoretic 
approach adopted to prove the above lemma. 

Let 5* be a subset of points of R satisfying the hypothesis of 
the lemma. By exploiting the Z-cover C, we will prove that 
S can be encoded into a binary string cod{S) of length at 
most 



log 2 



•yX{n,q,e,l,t) 



n 

The lemma thus follows since the number of these sets 5* can- 
not exceed the number of binary strings of the above length. 



Proof of Lemma\3} Let 5* be a subset of points of R satis- 
fying the hypothesis of the lemma. Consider the Z-cover C' 
of 5* having the same centers of C and where each radius r in 
C is replaced with a radius r' — \J [r^] . Clearly, this change 
is negligible in terms of cost. 

We now show that, thanks to C', S can be encoded into a 
binary string of length at most 



log 2 



\X{n,q,e,l,t) 



n 

i=i 

Thus the thesis follows by noting that the number of these 
sets S cannot exceed the number of binary strings of the 
above length. 



log 



n 
m 



+ 1 



Since in the cover, each of the m centers has radius at 
least I, it must hold ml^ < cost(C). From the hypoth- 
esis cost(C) < (1 — e)-Kn, we get 



m < 



(1 



(3) 



As for substring CEN, we obtain 



\CEN\ = 



log 



< m log ■ 



< log I - 1 + 1 



(1 — e)n , enP 
1 < -^^ 7^ log ■ 



m nP ° 1 — e 

Observe that in the above inequalities we used 



/eny 
V m / 



enP 



(1- 



since the function (^)^ is increasing in the range 



[l,n]; then, from ((Sjl, 



in the range [1 



c) Let now r^, r-2, . . . be the radii in C' arranged by in- 
creasing order of the indices of the m centers. In order 
to give the information on the radii of C' , we encode 
string [ri]^#[r2]^# . . . ['■m]^# in binary where bit 
is encoded as 00, bit 1 as 11 and the symbol # as 01. 



We thus get 

m 

\RAD\ = 2m + 2^riog(rri2] +1)] 
i=l 
m 

< 2m + 2^riog{ri2 + 2)] 

1=1 

Tn 

< 2)n + 2y^(logn^ +2) 

i=l 

= 21og^ri2 + 6m < 21og ^ — 



' 6m 



2 '"'(g) (l-e)n 
< 21og« 1^ +6m < 4^ -^logl + 6- 

= 4^i^log(V80 (4) 

In the above inequalities, we first used Y\T=\ '''^ — 
(^^^m^) since the product is maximized when all 
factors have the same value. Next, we used ^^2!ii£i^ < 

I since the function I '^"'^^ ' J is increasing in 

the range [1, ^2!li£i]; the value of m is in the range 
[l,cost(C)//^]; mf < cost(C) and cost(C)/;^ < cost(C)/e 
for I > ^/e. Finally, we bounded m using (|3]). 

d) In order to encode the nodes in S, we use t strings. The 
j-th string encodes the kj points of 5* in Rj. The kj 
points of S in Rj covered by C' axe a subset of the n'j 
points in Rj covered by C. Hence, we encode these kj 
points with a binary string of length 



logll 



+ 1 



Since for integers a, b and c where a < b < c \t holds 



that 



< f^r ( I. we get 



\cov\ = Yl 



log ( ( + 1 



+ t 



log n 1 

t k 



< log n 



^(nj-n'-) 



< loge-S^=i«("^--"^) +logn +f 



-i}(n - n') log e + log ]^ (^^^ j + t 
j^i J 



(5) 



where n' is the number of points of R covered by C'. 
We now give an upper bound for n'. 

Let X be the number of points of R covered by a disk D 
of radius r and, for each of these points, consider the 
square of area 1 centered in the point. These t squares 
are disjoint and are covered by the disk obtained by 
extending the radius r of disk _D to r + 1/ y/2 . So, the 
number of points of R covered by a disk of radius r is 
bounded by 7r(r- + l/\/2)'^. Moreover, it holds that 

= vTr^ < Vn^ + 1 < ^« + ^ 

where the last inequality follows since ri > I. We thus 
obtain 



n' < 



/ - / 1 1 \2 /I 1 \ ™ 
I"!) n + 1 ^ m + 2 1 = > r, 



We can now use Holder's inequality and obtain 

mm 



1=1 i = l 



\/ m ■ cost(C) 



Since m < cost(C)//'^, we get 
ri < 7rcost(C) I 1 + f i + 4 + 2f— + 



1 1 \^ 

n 



< (1 - e) IH + ^ , 

From ([5]), we get 



\COV\ < -gloge( l-(l-.)(l + i^ + -l^y 



(6) 



We now combine the bounds in ([2]), (Q, Q and ([6]) (respec- 
tively on the lengths of NUM, CEN, RAD and GOV) and 
obtain 

IcodCS)! < logn + l + -^ ?r-^°S- + 1 



4ii-^iog(v^o+iogn(:^- )+* 



,loge(l-{l-e)(l + i, + -Ly 



log(4n) + log n (.)+*+ ( l°g ■ 

i=l 



" 1 



gloge l-(l-e) 1 



1 1 



2/2 ' v^i) 



Proof of Theorem^ We assume 
lo- ''i^ 



n > 



1000 



9 



5(1 - 5)p„ 



For any S, with 5* C _R, define the binary function x 
follows 



1 if S has an /-cover of cost at most (1 — e)n/n 
otherwise 



Clearly, it holds that 

Pr[JZ,e,/]= ^P«(S)x.,i(S) (7) 

SCR 

Let us partition R into t regions Ri, R2, . . . , Rt where \ Rj\ = 
rij such that for 1 < j < t 



^ < j"— 1 and i?j = < Ui + k\l < k < n 



Define fij as the expected number of points in j, i.e., /i-, = 
Sigj? Pi- Let be the family of subsets of R having, in 
each region, a number of points not too small w.r.t. the 
expected number, i.e, 

T={S (£2'^\ \SnR,\ > (1 - S)fi,,l< j < t} 

From (O we get 

Pr[i?,e,/] = ^ Pb(5)xm(5)+ J2 PRiS)x.,i{S) (8) 

We start giving an upper bound on the first addend of the 
right-hand of the above equation. Let 

A = {k = {ki,...,kt) \ keZ' 

and (1 — 5)jj,j < kj < nj ,1 < j < t} 

and, for each k £ A, define 

J^j:^ {S eJ^\ \sn Rj\ = kj,l < j < t} 

Consider any set Sg € J-j: such that PR(S'g) > Pr{S) for 
every S G J-^. Then, 

E E Pfl(5£)x.,.{s) = E^«(S£) E 



< E ^«(^/c)2 
fceA 



n 



(9) 



where the last step follows from Lemma [3] Function A is 
decreasing in q and 



min < — > > min 



l<j<t 1^ rij J i<j<i L n 



(1 - 5)f,j 



> min 



(1 - 5)pminnj 



"L<j<t y rij 

(1 - &)Pr, 



We thus get 



^ 2^(".(i-5)p™,„,E.i,t) ^ J2 PniSj:) (10) 

fegASG^g 

Assume without loss of generality that the points in R are 
numbered in increasing order w.r.t. their probability i.e. 

< p,nin = pi < P2 < ■ ■ - Pn — Pmax < 1. Let go = Pmin < 

<li < <12 ■ ■ ■ < qt = Pmax where q, = max{pi|i e Rj}, for 

1 < j < t. Consider any set S G jTg, then 



= u( u u i^~pA = u( n 

j=i \ieSgnRj ieRj-Sg J j=i \isSnRj 

\ 



n n 



Pi 



( 



<n ^fl.(^nij,) n 
.=1 V 



1 - 



n — 



n „T 



9j 



i£{S-S^)nRj '^i'^ i€(Sg-S)nRj ''^ y 



1 - 



(S-S,-)nflj 



1-Qj-i 



1] 



1 - gj 



{S£-S)niij 



sn/«.(-%)n(i^T^ 



rf" 



PRiS) 



1 pmin Pmax 



Pmin 1 Pr) 



(jlOp implies that 

E PRis)x.AS) 



1 Pmin Pmax 
Pmin 1 Pma 



1 Pmin Pmax \ ^ t~\ 



Y,Pr{S)<2^(- 



Pmin 1 Pmax , 

1 Pmin Pmax 
Pmin 1 Pma 



(11) 



Now we provide an upper bound on 



\{n, (1 - &)pmin-. e, i) = (/(O - 5(0)" + log(4n) + t 



where 



and 



^, , 1 — e , 64e7rs; 
/ 0^0 = —5- log 



7(2;) = -(l-<5)p™,„ log e I 1 - (1 - e) ( 1 + ^ + 



Function /(x) is decreasing for x > ■fY~T)^^~~;j it holds 
that 



/(0<(i^r(^-t--l logf64e.r 5 



^(1 — S)Pmin(- 

Moreover, for every a, c > 0, it holds that a log f < f loge. 
Thus, by setting a = and c = (64e7r)6 , we get 

fil) < (1 - log e (12) 

5{en) 6 



Function 3(2;) is increasing for 2: > 1 and, by a simple cal- 
culus, we obtain 



g{l)>- -{1 ~ 5)p,ni„e\oge (13) 



From (IT2)) and (fT3)) . we obtain 



SC.2R-T 



1 - ^ Ph(S) 

t 



< 1- n ( l-e-V^J 



< 


1 - 


li(. 






















1 - 






< 


te~ 







(15) 



Finally, by combining (jS]), (fli)) and (fT5|) . the theorem fol- 
lows. □ 



3. AN ALMOST OPTIMAL SOLUTION 

We now provide an efficient construction of a covering range 
assignment for a random set S C _R of energy cost very 
close to the lower bound (1 — e)n/7r. Then we will trans- 
form it, with additional cost o(n) only, into a feasible broad- 
cast range assignment that uses 0(log(n/ log n)) ranges and 
such that the (positive) smallest among them, i.e. Zi, is 



e(Vtog^). 



A(n, (l-(5)pmi„,e,Z,t) < -Y^(l-(5)pmi„enloge-|-log(4n)-|-t 



Moreover since 



1 /I Pmin Pmax \ , ^ / e\ 
Pmax J 100 



1 /I Pmin 

r log 

' ' \ Pmin r 

(|11[) implies 



The disk covering construction. The construction of the 
covering is recursive and exploits a tiling of the square with 
octagons and triangles. 

The square R of side ^/n is partitioned into four triangles 
and an octagon (see Figure [T]); up to when there exists a tri- 
angle with side C\/log n, it is further on partitioned into five 
triangles (three small and two big triangles) and an octagon 
(see Figure [2]). 

Starting from this partition, it is possible to produce a disk 
covering OCT as follows (in the sequel, a range assignment 
is seen as a disk assignment with centers on nodes in S): 



We now give an upper bound on the second addend in (|8]). 
Let ^ = min{/.ii,/X2, ■ ■ ■ , ^^t} > Pmin[j\, then 



• for each triangle of the partition, if it contains at least 
one node, then one of them is selected as center of a 
disk having radius c^2 log n. Observe that this disk 
covers any other point inside the same triangle. 

• for each octagon, if it contains at least one point that 
is not covered yet, then Lemma [5] implies that there 




Figure 1: The starting configuration of the partition 
of the square into triangles and an octagon. 




Figure 2: The general step of the partition proce- 
dure of the square into triangles and octagons. 



is a node at distance 0{\/log n) to the center of the 
octagon, w.h.p. Let this node be the center of a disk 
having radius r + 0{\/\og n), where r is the radius of 
the disk that circumscribes the considered octagon; the 
introduced disk covers all points in the octagon. 

Theorem 4 Given a random set S, then, w.h.p., disk cov- 
ering OCT has cost 

cost(C'CT) < 1, 12- + o(n) 

Proof. Let 5* be the set of all the octagons in the parti- 
tion, and for each s £ S call the radius of the disk that 
circumscribes octagon s. Denoting by t be the number of 
triangles in the partition, by construction it holds that: 



cost(C) < 2tc^ \ogn + "^{ts + cy^lognf 



(16) 



Let I be the side of the triangles created during the first step 
and let be the side of the first octagon (see Figure [T|. The 



following equations hold: 21 + I' — ^Jn and I' = \f2l. From 
these, we derive: 



I 



2+72 



I' = 



2+V2 



(17) 



The recursive step depicted in Figure [2] produces triangles 
of two different sides and an octagon. Let Xa,Xb and Xc be 
the lengths of the sides of the bigger triangles, the smaller 
triangles and the octagon, respectively. These lengths are 
tied from the following relationships: Xc = V^Xb, Xa = Xb + 
Xc and Xa + 2xb Xc = I, implying: 



V2+1 



Xb = 



i 



(72+1)2 



v/2i 



(72+1)2 



(18) 



From (I17|) and (|18p . the triangles of the partition, generated 
during step i, have side length Xi = , where 



(2+v^)(v^+l)i 

< i < fc and k is the smallest integer value such that 
Xi < cy/\og n, i.e.. 



■ log(i+72) 



;2(2 + \/2)2 1og7 



(19) 



Observe that all octagons (but the first one) of the partition 
are produced by partitioning some triangle of side length 
Xi, < i < k. Denote by r the radius of the disk that 
circumscribes the first octagon, by Vi the radius of the disk 
that circumscribes the octagon produced by partitioning a 
triangle of side length Xi, and by ti the number of such 
triangles. Then, we can rewrite (|16p as follows: 

g-l fe-1 

cost(C) = t-o(logn) + ^^ti(''i^ + 2c\/lognri + c^logn) 

+ (r + clogn)^ (20) 
We remind that the radius of the disk that circumscribes 



a regular octagon having side I is 



I 



So, we can use 



\/2-s/2' 

(|17l) and (|18|) to compute the following values of r and Vi 
respectively, where < i < fc: 



V2n 



(2+v^)y2-v^) 



2~V2 (72+l)»+2 2+V2 



(21) 



In order to compute the value of ti, observe that trivially 
to = 4 (see Figure [1} and ti = 8 (see Figure [2]). At step i, 
ti = 2ti-i + 3ti-2. Unrolling the recursion we get: 



t, = 3'+' + (-1)' < 3'+' + 1 



(22) 



In order to evaluate cost(C), we bound all terms appearing 
in ^ by exploiting ^ and (f22|) : 



n \ ^ 

(2 + ^) (^+1)4 £^ (V2 + 1)2' 

n {V2 + l){4:V2 + 3) 



< 



(2 + y2)(y2 + l)4 
(64 - 45V2)n 



2V2 



4^2 



2 (2 - V2)n 
(r + clogn) = 



+ 0(1/71 log n) 



(23) 



(24) 



By combining Equations (|22|l and (|19|) we obtain 
t = S*"^^ + 1 



< 9 



1 \ log3(v^+l) / n \21og3(V2+l) 



c(2 + ^/2) 



loe n 



ylogn 

where the last step is true because ^ 
thermore, 



21og3(^+l) 



2c^/\ogn'^2i ^i'"* — "(n) 



(25) 
< 1. Fur- 

(26) 



Equation (l22j imphes that Y!i^Q U < t''; Then, from ((25)1 
we get: 



' log n ^ = o(n) 



(27) 



By combining formulas J^SJ, (US), 1211), (123 and (123 we 

conclude that 

(64-45V2)n (2 - ^)n , , ,,„„,n , , 

cost(C) = -^^ j=—!- — h- ^-^ — ho(n) < 1.1204-+o(n) 

4v 2 ^ ^ 



From Covering to Broadcasting. In order to guarantee 
that the produced covering becomes a broadcast, we need 
to connect the source to the disk centers in OCT. We start 
from the source, located in any place of the square, and build 
a chain of disks towards the center of the grid. Thanks to 
LemmalU the maximum radius of such disks can be bounded 
by 0(\/log n), w.h.p. (see Fig. |3}. We now show that the 
additional cost due to this construction turns out to be sub- 
linear. 



connect all the other centers to points already reached by 
the information sent from the source. The total cost due to 
this step is bounded by ij^jO{y/log n). By replacing 

the formulas for tj and Xj we get: 



= 0(yi^)-^(^3^+^(x/2 + 1)^- +^(V2 - 1)^- 

2 + V2 



= 0(yi^)_^e((3(V2 - 1))^+^) 

= 0(yi^)^^( 1 r^)log3^^_\^(V2+l, 

^2 + x/2 c(2 + ^/2)V logn^ 

This cost is sub-linear since it is ©(n""^). It is not hard to 
verify that the above overall construction can be performed 
in 0(n log 7i) time. 



4. AN EFFICIENT DISTRIBUTED PROTO- 
COL 

Let us consider the following simple algorithm to construct 
a broadcast range assignment. Let / be any range in F such 
that I > 2\/2cv'log n where c is the constant determined by 
Lemma |5] below. 

Algorithm cell-alg. 

a. Grid R is partitioned into square cells of side length A = 

1/(2^2). 

b. In every non-empty cell, choose one of its nodes and assign 

range I to it. This node is called the pivot of the cell. 

c. The cell containing the source will have the source as pivot. 

d. All other nodes have range 0. 




Figure 3: Construction of the chain of disks con- 
necting the source to the center of the first disk. 



The proof of the following lemma is a simple application of 
Chernoff 's Bound. 

Lemma 5 Let Pmin, Pmax, and c be three constants such 
that < Pmin < Pmax < 1 and c > l6/pmin • Let S C R be 
a random grid. Consider the partition of R into square cells 
of side length X where c^log n < A < ^/n. Then, a constant 
7 > exists such that every cell contains w.h.p. at least ^X^ 
nodes. Constant 7 can be set as (l/2)pmin. 

It is then easy to prove the following 

Theorem 6 Algorithm CELL-ALG yields a broadcast range 
assignment r that is w.h.p. feasible and its cost satisfies 



cost(r) = — ■ (2\^Xf = 8n 



The cost of the connection between the source and the cen- 
ter of the square is 0(y'rI\/log n) w.h.p. Then we have to 



Thanks to our lower bound in Theorem 12] CELL-ALG yields 
constant approximation. 



Making it in distributed way. Algorithm cell-alg can 
be converted, without paying any extra energy cost, into an 
efficient, energy-load balanced protocol that performs a se- 
quence of broadcast operations. We describe the protocol for 
the many-broadcast scenario and, thus, besides minimizing 
the energy spent by a single broadcast operation, we aim 
to evenly distribute the transmission task among all nodes 
(but the source). 

According to the standard radio communication model [U 
111! I26| , we assume that nodes act in discrete uniform time 
steps and are non spontaneous. However, we assume a 
weaker, local synchronous model: if, at a given time step 
t, the range of a message transmission covers a cell, then, at 
time step t -\- 1, (only) the nodes of that cell are activated 
and, so, they will agree on the same time step. We assume 
that every node v knows the number n of points and its 
relative coordinates in the square grid R. From its relative 
coordinates every node computes a unique local label with 
respect to its cell. These local labels vary from 1 to A^. 
The k-ih message sent by the source is denoted as . Phase 
k consists of the sequence of time steps where Ms, is broad- 
casted. We assume that Mfe contains the value k. 

The protocol performs, in parallel, two tasks: i)\t constructs 
a broadcast communication graph starting from the source 
and ii) transmits the source message along this graph to 
all nodes. The procedure is executed for every broadcast 
operation from source s. Every node keeps a local counter 
counter initially set to —1. 

Procedure BROADCAST(Mfc) 

Source s transmits, with range I, (Mj.,i) where i is the index of 
its cell. 

All nodes (but s): 

• If 5: 7-^^) then (7 is the constant of Lemma [Sjl 

— When a node v receives, for the first time w.r.t. phase 
fc, (Mfc, i) from the pivot of a neighbor cell i, it becomes 
active. 

— An active node, at every time step, increments its local 
counter counter by one and chocks whether its local 
label is equal to the value of its counter. If this is the 
case, it becomes the pivot of its cell and transmits, 
with range I, (mj,, j) where j is the index of its cell. 

— When an active node in cell i receives (Mj.,j), it (so 
the pivot as well) records in a local array P[k] the 
current value of its counter, i.e. the local label of the 
pivot, and becomes inactive. 

• else (i.e. (A; > 7A^)) 

— When a node v receives, for the first time w.r.t. phase 
fe, (Mfc, i) from the pivot of a neighbor cell i, it checks 
if its local label is equal to P{k mod 7A^]. If this is 
the case, it becomes the pivot of its cell and transmits, 
with range I, (m^., j) where j is the index of its cell. 

Fact 7 Even though nodes initially do not know anything 
about each other, all nodes in the same cell are activated 
(and disactivated) at the same time step; so, their local coun- 
ters share the same value at every time step. Furthermore, 
after the first broadcast operations (i.e. phases), all 
nodes in the same cell know the set P of pivots of that cell. 



More precisely, if jo < ji < j2 < . ■ . jk . ■ . are the lo- 
cal labels of the nodes in a cell, then, during the first 7A'^ 
broadcast operations (i.e. phases), the pivot of the cell at 
phase k will be the node having local label jk. 

Procedure Broadcast has the following properties. 

Energy Cost. As for each single broadcast operation. 
Broadcast yields a broadcast range assignment equivalent 
to that of CELL-ALG. So, Theorem [S] holds as well. 

Work Complexity. 

Definitions Let {Gi,G2, ... ,Gh} be the set of all messages 
sent by the nodes according to a protocol P. Then, the work 
complexity of P ts 

h 

ij, where U is the range used to send Gi 

1=1 

The overall number of node transmissions (i.e. the mes- 
sage complexity) of every execution of Broadcast is 8n/Z^. 
Each transmission has range /, so the work complexity is not 
larger than 8n. 

As for the many-broadcast scenario, our lower bound in The- 
orem [2] easily implies that a work fc(l — e){n/n) is w.h.p. 
required to perform a sequence of k broadcasts (since the 
lower bound holds for the energy cost). It follows that our 
protocol achieves an almost optimal work complexity for the 
many-broadcast operation as well. 

Load Balancing and Network Lifetime. The expen- 
sive pivot's task is evenly assigned, w.h.p., to 7A^ nodes 
(see Lemma [5]) in the same cell by using a round robin 
schedule. This is crucial when the number of broadcasts 
increases and nodes have limited battery charge. As for the 
many-broadcast operation, it is possible to show that our 
protocol achieves an almost maximal lifetime according to 
the consumption model in [Tj [6]. In this model, the goal 
is to maximize the lifetime of the network while guarantee- 
ing, at any phase k, a broadcast operation from the source. 
Formally, each node v is initially equipped with a battery 
charge^ B > 0. Whenever a node transmits with range I, its 
battery charge is reduced by amount /3 • l^ where I denotes 
the range assigned to node v and /3 > is a fixed constant 
depending on the adopted technology. We assume /3 = 1, 
however, all our results holds for any /3 > 0. 
Then, the Max Lifetime problem is to maximize the num- 
ber of independent broadcast operations till some node will 
die (i.e. its battery charge becomes 0). In [7], Max Life- 
Time is shown to be NP-hard. 

Theorem 9 Broadcast performs a sequence of indepen- 
dent broadcast operations whose length is only a constant 
factor smaller than the optimum, w.h.p. 

Sketch of proof. We have already observed that the work 
complexity of Broadcast for any single broadcast opera- 
tion is not larger than aopt, where a is a positive constant 

^Here we assume that, at the very beginning, all nodes are 
in the same energy situation. 



and opt is the optimal work complexity. So, the maximal 
number of independent broadcast operations is not larger 
than nB /opt. Thanks to the local round robin strategy in 
every cell, the energy load of the many-broadcast operation 
is well balanced over at least a (large) constant fraction rj of 
all nodes. So the number of broadcast operations perform 
by Broadcast is at least = (v/a)^, w.h.p. □ 

(Amortized) Completion Time. 

Theorem 10 The amortized completion time (i.e. the av- 
erage number of time steps to perform one broadcast opera- 
tion) over a sequence of T broadcast operations is w.h.p. 

0(l^/T + ^/l) 

Sketch of proof. For a single broadcast operation performed 
by Broadcast, we define the delay of a cell as the number 
of time steps from its activation time till the selection of its 
pivot. Observe that the sum of delays introduced by a cell 
during the first jX^ broadcasts is at most = 0(/^). Then, 
the delay of any cell becomes for all broadcasts after the 
first jX^ ones. Moreover, a broadcast can pass over at most 
0[^/n/l) cells. By assuming that a maximal length path 
(this length being 0(07//)) together with maximal cell de- 
lay can be found in each of the first min{7A^, T} broadcasts, 
we can bound the maximal overall delay with 

0{lV^) (28) 

Finally, the number of time steps required by every broad- 
cast without delays is 

0{V^/l) (29) 

since the length of any path on the broadcast tree is 0{^/n/l). 
By combining (|28p and (|29[). we get the theorem bound. □ 

For brevity's sake, the amortized completion time has been 
analyzed without considering the interferences due to colli- 
sions among pivot transmissions [2]. However, in order to 
avoid such collisions, we can further organize Broadcast 
into iterative stages: in every stage, only cells with not col- 
liding pivot transmissions are active. Since the number of 
cells that can interfere with a given cell is constant, this 
further scheduling will increase the overall time by a con- 
stant factor only. This iterative process can be efficiently 
performed in a distributed way since every node knows n 
and its position, so it knows its cell. 

Corollary 11 The completion time of one single broadcast 
operation is 0{ly/n). 

The worst scenario for our protocol occurs when T is small, 
say T = 0(1). Indeed, assume that a transmission range 
I — 0(\/log n) is available in F, then we get an amortized 
completion time 0(^n log n) that is a factor logn larger 
then the optimum. Notice that in this case, the network 
diameter is Q{y/n/ log n) w.h.p. 

Whenever T = n(logn), we instead get 0{^n/ log n) amor- 
tized completion time which is optimal. 



4.1 Experimental results 

In this subsection, we present the experimental results we 
have obtained by running Algorithm CELL-ALG. We have 
generated 1000 instances for every side length 

y/ne {13,20,25,30,50,100} 

and for node-probability p £ {0.2,0.5}. As usual, our im- 
plementation benefits of some parameter tuning and opti- 
mization: the pivot node (but the source node) inside every 
cell is the one closer to the center of the cell and useless, re- 
dundant ranges are removed. These tasks can be performed 
also by the distributed protocol, after the first phase (i.e. for 
t > ■yX^), without paying any extra energy cost since, after 
that time, every node of a cell knows all its cell neighbors. 
Moreover, the transmission range I is selQto \/21ogn, while 
the cell-size parameter A is set to logn. Notice that, accord- 
ing to such choices, the feasibility (i.e., the existence of a 
path from the source node to all other nodes in the induced 
communication graph) is tested too. In Table [1] (columns 
of feasible sol.") , the number of feasible solutions for the 
different combinations of n and p are reported. 

The solution costs of CELL-ALG are compared to the cost of 
the solution returned by the centralized MST-based algo- 
rithm. We remind that while the energy cost of CELL-ALG is 
an upper bound on the work complexity of our distributed 
procedure Broadcast the energy cost of the MST-based 
solution does not provide any information about the work 
complexity of its distributed implementations (this can be 
much larger). 

Table [T] shows, for all chosen values of p and -y/n, the mini- 
mum, average and maximum ratio between the costs of the 
solutions returned by the two algorithms. As for CELL-ALG, 
only the costs of feasible solutions are considered. 
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